Building a Data Lake with Spark and Iceberg at Home to over-complicate shopping for a House 2021-12-03: How I build what is essentially a self-service Data Lake at home to narrow down the search area for a new house, instead of using Zillow like a normal person, using Spark, Iceberg, and Python.
scala
spark
iceberg
python
sql
trino
geopandas
big data
hadoop
hive
presto
geospatial data
analytics